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Abstract 

In a model of a connected network on random points in the plane, 
one expects that the mean length of the shortest route between vertices 
at distance r apart should grow only as O(r) as r — + oo, but this is not 
always easy to verify. We give a general sufficient condition for such 
linearity, in the setting of a Poisson point process. In a L x L square, 
define a subnetwork Ql to have the edges which are present regardless 
of the configuration outside the square; the condition is that the largest 
component of Ql should contain a proportion 1 — o(l) of the vertices, 
as L — > oo. The proof is by comparison with oriented percolation. We 
show that the general result applies to the relative neighborhood graph, 
and establishing the linearity property for this network immediately 
implies it for a large family of proximity graphs. 
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1 Introduction 



The phrase random networks encompasses a very wide range of mathemati- 
cal models and real-world motivations. We will consider the spatial network 
setting where the vertices are points in two-dimensional space. In ordinary 
language, "network" often carries the implication of being connected, so 
it is ironic the the two most classical models for (infinite) random spatial 
networks are not connected. These are 

(i) [Random geometric graph |16]]. Start with a Poisson point process of 
vertices; put an edge between vertices i, j if the Euclidean distance d(i,j) is 
less than some prescribed constant c. 

(ii) [Bond percolation [10 J. Start with the square lattice; retain or delete 
edges at random. 

A third class of models is connected: 

(iii) [Small world models [2]] . Start with the usual square grid of vertices 
and edges; add extra edges with probability p(i,j) for prescribed p(-). 

However this third class of models is designed for settings where the 
relevant notion of path- length is "number of edges in the path" . We will 
be concerned with the "purely geometric" setting where the notion of route 
length is "sum of Euclidean lengths of edges in the route", so there is a 
network distance between any two vertices, defined as the minimum (over 
all routes connecting the vertices) route length. In contexts where networks 
are designed to be connected and network distance is a topic of interest, using 
mathematical models which are not a priori connected is both conceptually 
unsatisfying and technically complicated, in that one is forced to consider 
quantities such as "mean network distance conditioned on the two vertices 
being in the unique infinite component" . 

This paper is a technical contribution to a program (see [3] for a survey) 
studying connected networks on random points, where we view the infinite 
Poisson process as proxy for a large finite unstructured set of points. The 
main result concerns the very general class of CIDRPP networks that we will 
now specify, after apologizing for the ugly name, and after mentioning a ter- 
minological convention we find helpful. Visualizing road networks, we write 
city and road and route for the objects in a network model we are studying; 
and write vertex (site) and edge and path for mathematical graphical objects 
constructed in the course of the proofs. 

The PP denotes Poisson process, precisely the rate 1 per unit area Pois- 
son point process used as the model for city positions. The DR denotes 
deterministic rule for roads; given the configuration of cities, whether or 
not a straight road links the two cities is determined by some deterministic 
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rule, which need not be "local" (it may depend on the entire configuration) 
but is required only to be invariant (i) under translation. (Note that us- 
ing simple random rules would tend to leave some cities isolated and hence 
disconnect the network). The resulting IDRPP network is a translation- 
invariant process in the plane, and the "random geometric graph" in (i) is 
perhaps the simplest example. Finally we want the network to be connected 
((7), and adding that requirement gives the class of CIDRPP networks. The 
best known example is perhaps the Delaunay triangulation [9] on the PP; 
another example is the minimal spanning tree (MST) on the PP (see sec- 
tion 3.4). But more interesting for our purposes is the relative neighborhood 



graph (RNG) [TT] defined by 

there is a road between two cities x, y if and only if there is no 
other city z with max(d(z,x),d(z,y)) < d(x,y) 

where d denotes Euclidean distance. See Figure 6 for an illustration. This 
particular network is interesting because (loosely speaking) it is the sparsest 
connected graph that can be defined by a simple local rule; more precisely 



(see section 3.2 ) there is a family of proximity graphs which are supergraphs 
of the RNG, and therefore can only have shorter network distances. 

The examples above are planar graphs, for which the notion of route is 
unambiguous. Our results will apply to non-planar networks, if we make 
the convention that if a road (xi,X2) crosses a road (£3,^4) then there is 
a route (xi,x^) via the junction. In most natural examples the DR is also 
rotation-invariant, but we do not need to assume that. 



1.1 The linearity property 

Now consider some CIDRPP network. How should we formalize the "lin- 
earity" property 

mean network distance between vertices at Euclidean distance r 
apart should grow only as 0{r) as r — > 00. 

By familiar properties of the PP, conditional on cities at prescribed points 
z l ,z 2 in M 2 with z 1 — z 2 = z, the network distance between them is dis- 
tributed as if we "planted" cities at the origin and at z, other cities being 
distributed as the PP, and considered the network distance T z between the 
planted cities. Here T z is random with \z\ < T z < 00. So we could formalize 
the linearity property as 

ET Z 

limsup — — < 00 as \z\ — > 00. (1) 
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Unfortunately it seems technically hard to deal with this definition at the 
level of generality we seek, because the deterministic rule for edge presence 
may depend on the entire configuration. Instead we shall use the "integrated 
out" analog. Denote by £ the cities in the PP, and write £') for network 
distance. Say a CIDRPP network has the linearity property if, for any choice 
of bounded subsets A, B of M 2 , 

E E E *(£,0 = O(M) «* 1*1 ( 2 ) 

Here z + B := {z + z' : z' € B}. 

Clearly it is enough to prove Q when A and B are squares (of some 
one arbitrary given size) centered at the origin; by considering very small 
squares we see this definition is intuitively similar to ([I]). 

A glance at Figure 6 suggests that the RNG has this linearity property, 
and Monte Carlo simulations |3 suggest a limit of about 1.38 in Q. But we 
do not know any proof which is "constructive" , in the sense of first describing 
an algorithm for choosing a route and then bounding the mean length of 
the chosen route. Rather than seeking some non-constructive proof tied to 
this particular model it seemed more useful to seek a more widely applicable 
result, and this consideration was the motivation for this paper. It is clear 



(intuitively, at last - see section 3.4 1 that the MST does not possess the 



linearity property, so we need to impose some further assumptions on a 
CIDRPP network to ensure the linearity property. 

It turns out that one condition is sufficient. Consider the L x L square 
[0,L] 2 . Then consider the subnetwork Ql defined in words as 

the cities in [0,L] 2 , with the roads that are present 
regardless of the configuration of cities outside [0,L] 2 . (3) 

To say this more formally, consider a deterministic configuration of cities in 
the plane, write x for the configuration of cities inside [0, L] 2 , and x±, X2 for 
two such cities, and write y for the configuration of cities outside [0,L] 2 . 
The deterministic rule defining the network is an {edge, no — edge}- valued 
function f(x\, x 2 , x, y) specifying whether to put an edge between x\ and 
X2- Define a new function 

/(xi,x 2 ,x) = edge iff f(x 1 ,x 2 ,x,y) = edge for all y. 

Now Ql is the network obtained by applying rule / to assign roads to a PP 
of cities inside [0,L] 2 . By construction, Ql is a subnetwork of the original 
network in the infinite plane. 
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The subnetwork Q l need not be connected, so write N® for the number 
of cities inside [0, L] 2 that are not in the largest component oIQl- Consider 
the asymptotic essential connectedness property 



Lr' A ENl -» as L -» oo. (4) 



Theorem 1 If a CIDRPP network satisfies the asymptotic essential con- 
nectedness property Q then it has the linearity property M). 

We prove this in section [2] by comparison with oriented percolation. This 
general method is well known in the theoretical spatial random processes 
literature, and is indeed a central focus of the 1988 monograph of Durrett 

H- 

Proposition [9] in section [3] checks that the relative neighborhood graph 
satisfies the assumptions, and hence the conclusion, of Theorem [T] 

1.2 Remarks 

Let us conclude the introduction with some brief remarks. 

1. We should acknowledge that Theorem [T] is "purely theoretical" in that 
the argument gives very large bounds. Obtaining numerically reasonable 
bounds in networks like the RNG seems a hard problem. 

2. Our linearity property Q can be viewed as the average-case analog 



of the worst-case concept of a spanner, described in section 3.2 

3. A counter-intuitive result [2] says that, if one seeks to design a network 
over a PP with only the two goals of minimizing ET 2 (defined above (|T|) for 
large \z\, and minimizing mean road length per unit area (a, say), then one 
can construct networks such that 

(i) |z| _1 Er 2 -» 1 as \z\ -> oo 

(ii) a is arbitrarily close to a ST , the value for the Steiner tree, the smallest 
value for any connected network. 

Consequently, while Theorem [T] is intended to be useful for establishing 
linearity in networks that are given to us, it isn't needed for networks that 
we are allowed to design. 

4. Where ([I]) holds it is natural to conjecture that for fixed 9, taking 
z = (r, 9) in radial coordinates, the limit 



lrm 



M) 
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always exists, and that this property (analogous to the shape theorem in first 
passage percolation) should be easy to prove by subadditivity. But counter- 
intuitively, existence of the limit does not seem easy to prove; a result of this 
type, formalized by an analog of Q, will be given in a companion paper [I]. 

5. For technical reasons the hypothesis of [1] uses a "L 2 " analog of the 
"L 1 " linearity property ([2]), which in fact (section 2.7) follows from the proof 



of Theorem [T] we record the general fc'th moment form as 

Corollary 2 Under the hypotheses of Theorem^ for each k > 1 and each 
choice of bounded subsets A, B ofM 2 , 

T max(l,M*) < °°- (5) 



2 Proof of Theorem Q] 

The argument uses a very standard methodology in percolation theory: a 
block construction and a comparison with oriented percolation. The closest 
explicit results in the percolation literature are those such as [5] studying 
network distance within the infinite component and giving bounds which 
hold for all p in the supercritical regime. But several difficulties arise in 
applying such results directly in our setting. Because we are bounding an 
expectation we can't simply ignore events of probability — ► 0; and we need 
information on distance-related quantities for all pairs of sites, not just the 
"open" sites. 

Anyway, instead of trying to exploit sophisticated results from percola- 
tion theory we shall give an argument using only one fundamental result, 
Proposition [3] below. We outline the overall argument in section 2.3, after 



giving the block construction in section 2.2 



2.1 Comparison with oriented percolation 

Following Durrett [7j we set up a tilted square grid, the graph C = (V, £) 
with vertices (sites) V = {(ni,7i2) : rii £ Z,ni + n,2 even} and edges 
(ni,rt2) — (ni db 1,U2 + 1). A doubly-infinite oriented-up path is a path 
{(m(n), n), —00 < n < 00} where m{n + 1) = m(n) =t 1. 

Fix < p < 1. A 1-dependent p-site process is a random process in which 
each site v £ V is declared good or bad is such a way that 
(i) P(v is good) = p 
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(ii) for disjoint subsets V\, V% of V such that there is no edge (v\,V2) with 
Vi G Vi, the family of events {v is goodjueVi is independent of the family 
{v is good} ue y 2 . 

In such a process, a good path is a path through only good sites. Our 
argument is based on the following standard result, illustrated in Figure 1. 

Proposition 3 There exist po < 1 and finite constants > 1) such 

that a 1-dependent p-site process with p > po has the following property. 
Let T > 2 be the smallest even positive integer such that site (T, 0) is in a 
doubly-infinite oriented-up good path. Then KT k < for all k > 1. 

This result is given in Durrett [7J section 10 by a "contour argument". (In 
fact [7] does the semi-infinite case, but the doubly-infinite case is similar.) 




Figure 1. Black discs denote good sites. The arrows indicate edges of one 
doubly- infinite oriented-up path starting through (T, 0). 



2.2 The block construction 

Fix large L. Write Cl = (Vl,£l) for the tilted square grid scaled by L, so 
its vertex-set is Vl = {(Lni, Lwi) : n» S Z, rti + n2 even}. Write V* L for the 
set of midpoints of edges of Sl- For each v G Vl U V* l define the following 
objects. 

S v is the open L x L square centered at v; 

Gv is the subnetwork Q in S v ; that is, the roads within S v that are 
present regardless of the configuration of cities outside S v ; 

N\(G V ) is the number of cities in the largest component of G v ; 
len(Q v ) is the total length of the roads of Q v . 

Note that the squares (S v ) v( ,y L are disjoint, as are the squares (S v ) ve y* . 
Call v £ Vl and v* £ V* L diagonally adjacent if v* is the midpoint of some 
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edge at v, in which case S v C\S V * is a L/2 x L/2 subsquare of S v . See Figure 
2. 




L 



• • * 

Figure 2. The • are vertices of Vl and the (^) are vertices of V* L . Here u and 
v* are diagonally adjacent. 

Call v G V L UV L nice if 

(i) each of the four natural L/2 x L/2 subsquares of S v has between 0.24L 2 
and 0.26L 2 cities ; 

(ii) iVi(g„) > 0.99L 2 ; 
(hi) len(G v ) < c L 

where the constant cl will be specified later. Now call v G Vl good if v and 
its four diagonally adjacent midpoints v* G V* L are all nice. 

Using hypothesis Q, we can choose L and cl such that P(v is good) > 
po, where po is the value in Proposition |3j Let us emphasize that L and cl 
remain fixed for the rest of the argument; we are never going to say "let 
L -> oo". 

The construction above ensures the 1-dependent property for the process 
{v is good}^-^ , because if v\ and i>3 are not adjacent in £l then squares 
S v * and S v * (centered at midpoints v * of edges at Vi) cannot overlap. 

Observe the following consequence of parts (i) and (ii) of the definition 
of nice above. If v* G V* L is a midpoint of an edge at v G Vl and if both v 
and v* are nice, then the largest components of Q v and of Q v * must have a 
common city in the subsquare where the two squares intersect. So for any 
cities £ and £* in the largest components of Q v and of Q v *, there exists a 
route in the underlying network from £ to £* for which each road used is in 
Qv or Q v * . (Note this is where the "regardless of ... " in the definition of Q 
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comes into play; this route exists, under the stated assumptions, regardless 
of other aspects of the configuration of all cities). 

Taking into account the definition of good site, we deduce the following. 
Consider a good path vq, v\, . . . , v q in Vl, and write (v*) for the midpoints 
of edges along that path. Then, for any cities £o and £ 9 in the largest 
components of Q VQ and Q v , there exists a route from £o to £ 9 in which each 
road used is in some Q v . or Q v *. Such a route has total length bounded 
by (2q + 1)cl, by (iii). Now for an oriented-up path we must have q < 
d(vo, v q )/L\ also d(vo,v q ) < d(xo,x q ) + Ly/2. Recalling our notation 
for network distance, we have shown 

Lemma 4 Let £, £' be cities in the PP, in the L x L squares centered at 
vertices v,v' oJVl, and in the largest components ofQ v and ofQ v i. If there 
exists an oriented-up good path in Vl from v to v' then 

< (4 + 2L- 1 d(£,0)cL. 



2.3 Outline of argument 

In outline, the argument for Theorem [T] is rather clear. 

(i) Given a small square A at the origin and the translated square z + A 
for large \z\, Lemma [4] and Proposition [3] allow us to find two routes, one 
through some city £ near A and the other through some city £' near z + A, 
which can be oriented so as to meet somewhere in between, and so create a 
route from £ to £' of length 0(|z|). 

(ii) We want to make a route from cities in A to £, with route-length 0(1). 
We do this by constructing in Vl a circuit, with length O(l), of good sites 
including the site v near £ and encircling the origin. This implies existence 
of a length 0(1) encircling circuit through £ in the underlying network. 
Connectivity now ensures a length O(l) route from cities in A to £. 

Saying this carefully with the specific "tilted square grid" construction 
of section [2^2] leads to technical difficulties in making paths meet. We deal 
with this difficulty by creating paths which can be oriented in more narrowly 
specified directions, by using a number of "affine lattices" described next. 
Then part (i) is formalized in section 2.5 and part (ii) in section 2.6 Where 



the section |2.2| arguments carry over straightforwardly to the "affine lattice" 
setting we will omit the details. 
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2.4 Affine lattices 





Figure 3. The six sectors (left) and the affine lattice C a 
with sector a (right). 



(V a ,£ a ) associated 



Divide the half-plane into six sectors, labeled a—f, as on the left of Figure 3. 
Associate with each sector an "affine lattice", written as e.g. C a = (V a ,£ a ), 
where the case "a" is illustrated on the right of Figure 3. For a, b, c the 
vertex-set is V = {(ni,ri2) : rn G Z, n\ + ni divisible by 3}. The edge- 
set illustrated for £ a has edges from (77.1,712) to (n\ + 0,ri2 + 3) and to 
[n\ + 1,^2 + 3), where the added vectors (0,3) and (1,2) are the points 
(Figure 3, left) defining the boundary of sector a; analogously for C b and 
C c . The affine lattices & ,C e , C d can be viewed as the top-bottom reflections 
ofjC a ,£ b ,£ c , and their vertex set is {(ni, 712) : n« G Z,ni— ri2 divisible by 3}. 

We will describe a construction involving C a ; the same construction can 
be applied to each of these lattices. 

We can obtain C a via a linear transformation applied to the tilted square 
grid C = (V, £) of section pTTj As at the start of section 2.2 scale C a by L 
to get a lattice C a L = (V a L ,£ a L ). 



We now repeat the construction in section 2.2 replacing squares by their 



images (parallelograms) under the linear transformation. This involves some 
obvious changes to constants, which we will not write out in detail: for 
instance the "L 2 " in the definition of nice is replaced by L 2 times the area 
of the appropriate basic parallelogram. Making the analogous definition of 
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v £ VI being good for C a L , we can copy the argument for Lemma [4] to obtain 

Lemma 5 There exists a constant A such that the following property, stated 
for L a L , holds for each of the six affine lattices. Let £, £' be cities in the PP, 
in the scaled parallelograms S% and S 1 ", centered at vertices v and v' of VI, 
and in the largest components of the corresponding Q% and Q",. If there 
exists an oriented good path in V a L from v to v' then 

<A(l + d(t,?)). 

Remarks on Lemma^ (i) In the bound we are now supressing the depen- 
dence on L, which is fixed. 

(ii) A path in (say) L\ is oriented-right if each edge is a translate of one 
of the vectors bounding the sector in Figure 3, and oriented-left if it is a 
reversal of such, and oriented if either oriented-right or oriented-left. 

(iii) Hypothesis Q, the asymptotic essential connectedness property, was 
stated for unit squares scaled by L, whereas to prove Lemma [5] we need it 
to hold for a fixed parallelogram <0 scaled by L. But this is true because for 
any e we can find a finite collection of (overlapping) squares inside <^> whose 
union has area at least (1 — e)x the area of <0>. 

Proposition [3] transfers directly to the affine lattice setting. To state the 
conclusion, recall we are interested in the distance from site (0, 0) to a nearby 
site in (say) L\ which is in a doubly-infinite oriented path. We measure 
distance by considering the nearest such site in a specified direction along 
either the reverse diagonal \[n\,n<i) : ni = — n\\ or the forward diagonal 
{(ni,n2) : ni = ni}, choosing a diagonal which does not bisect the sector 
under consideration. Write the Euclidean distance from (0, 0) to the nearest 
distinct such site as e.g. T£", the arrow indicating the diagonal and direction 
in which we are searching. Call the site . There is a finite collection T 
of random variables such as arising from different choices of sector and 
diagonal direction, and Proposition [3] implies there exist finite constants n' k 
such that for each T £ T, 

ET k < 4, 1 < k < oo. (6) 

2.5 Global routes 

The construction is illustrated in Figure 4. Consider the origin (0, 0) and 
some site v in the scaled square lattice 7? L . Then v is in some sector, say 
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sector b. Consider the two adjacent sectors, in this case a and c. In the 
figure, vo = vS^ is the closest site to (0, 0) along the diagonal \ which 
is in a doubly-infinite oriented path in CI, and v\ is the closest site to v 
along the diagonal \ which is in a doubly-infinite oriented path in £° L . 
One can always choose the diagonal and direction so that, as in Figure 4, 
for each edge eo in the path from v$ toward the crossing point z and each 
edge e\ in the path from v\ toward the crossing point z, the edges are at 
angles bounded below ir/2. So the length of the path from vq to v\ via 
the crossing point, interpreting the lattice edges as line segments in M 2 , is 
0{\v\ + \vq\ + d(v, vi)), by elementary geometry considerations. Then as in 
the argument for Lemmas [4] and [5j we can associate with this lattice path a 
route in the underlying network, of length at most Cf,x lattice path length. 
This construction leads to the following bound, discussed below. 




Figure 4. The "global" part of route construction. 



Proposition 6 Given any v £ J? L , there exist sites v$,v\ £ T? L and sectors, 
say a and c, such that vq (resp. v\) is a good site for C a L (resp. C C L ), and for 
any cities £, f in the scaled parallelograms and S% centered at vertices 
v o °f Vl an d v i °f VL> an d i> n the largest components of the corresponding 
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O < B(l + d((0, 0),vq) + ui) + H). 

i^ere B is a constant and 

each of d((Q,Q),vo) and d(v,v±) is distributed as some T G T. (7) 

Discussion. One notational detail is that the affine lattices £^ only 
contain as vertices 1/3 of the vertices of T? L , so i> may be in one of the two 
translated copies of C C L which cover the other vertices; this does not affect 
the argument. 

An issue of more substance is to check that, where the two lattice paths 
cross, we can connect one associated route in the underlying network to the 
other route. To check this, consider Figure 2. Any point z within an edge 
of the lattice is between some v G Vl and a diagonally adjacent v* G V* L . 
Suppose, as in Figure 2, that v* is closer than v to z. Properties (i) and (ii) 
of nice imply that S v * contains at most 1.04L 2 cities, and that the largest 
component of Q v * contains at least 0.99L 2 cities. Transforming to the affine 
lattice setting, there are numerical constants ai,(3i = (0.99/1.04)aj, i = 
a, . . . , /, and the crossing point z in Figure 4 is in some parallelogram S a 
for which 

(i) S a contains at most a a L 2 cities; 

(ii) the largest component of the associated Q a contains at least j3 a L 2 cities; 

(iii) z is on the line from the center to a corner of S a , closer to the center 
than to the corner; 

and similarly for another parallelogram S c . These constraints force the 
intersection of the largest components of Q a and of Q c to be non-empty, 
which is what is required to link the routes. 

2.6 Local routes 

Recall that each T £ T is the distance from the origin to a nearby site on 
the diagonal, say vt- We will prove 

Proposition 7 There exists a random variable D, with all moments finite, 
such that for each T G T, each city (if any) £ in the associated scaled 
parallelogram S VT , and each city £* in the unit square centered at (0,0), 

£(C,0<D. 
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Proof. Take one T £ T, say T(a, \). Take a sector that is two sec- 
tors away away from a, in this case sector c, and consider (Figure 5) the 
four doubly-infinite oriented paths associated with T(a, \), T(a, \), T(c, \ 
),T(c, \). These define a path circuit around the origin, of length at most 
where S := T(a, \) + T(a, \) + T(c, \) + T(c, \) and B l (and £ 2 
below) are constants. As argued below Proposition [6j we can construct an 
associated route in the network, of length at most B2T, for some B2. 




Figure 5. Constructing a route encircling the origin. 

Looking back at Figure 2, route construction relative to the square grid, 
consider a path in Cl passing through v but not an adjacent (in Cl) vertex 
w; the associated network route in this region cannot come closer to w than 
distance Lj\f2. It follows that the route constructed above encircles the 
origin and does not come closer to the origin than 5L, for some constant 
5 > 0,. By taking L large we may assume the route encircles the unit square 
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centered at the origin. The encircling route, being at length at most B2T1, 
must stay within the disc of radius By connectivity of the network, 

from any city £* in the unit square there is a route which meets the encircling 
route and then continues to v£. This route stays within the disc of radius 
B2T1. We can therefore bound its length by the r.v. D(i?2S) in Lemma [8] 
below; combining ^ with the conclusion of Lemma[8]establishes Proposition 

m - 

At the end of the proof we used the following straightforward lemma. 

Lemma 8 For the PP (with a city planted at the origin) and for a > 0, 
define a r.v. D{a) to be the sum of Euclidean distances between all pairs of 
cities within the disc of radius a. Then for any r.v. £ with all moments 
finite, the r.v. D(T,) has all moments finite. 

2.7 Completing the proof 

Theorem [T] and Corollary [2] follow from Propositions [6] and [7J and (|6] [7| , as 
we now explain. 

Consider cities and £** in the unit squares centered centered at the 
origin and at v E T? L . Proposition [Tj applied at the origin and at v, together 
with Proposition [6j implies 

i(C,Cl <B(l + \v\+T 1 + T 2 )+D 1 +D 2 (8) 

where T\ and T 2 satisfy ^ and D\ and D2 have the distribution of D in 
Proposition [7j So, writing U for the unit square, 

KC,C*)<{B(l + \v\+T 1 +T 2 ) + D 1 + D 2 ) mN 2 

t*£U£**£v+U 

where N\ (resp. N2) is the number of points of the PP in U (resp. v + U). 
The random variables on the right side have all moments finite, uniformly 
in v, and so 

E E E *(r,r*) = o(M)asM->oo. 

This is enough to establish Theorem [l] (here v is restricted to H? L , but the 
bound holds for all sufficiently large L) . Corollary [2] is derived in the same 
way after first taking the fc'th power in (pi). 



15 



3 The relative neighborhood graph 

The definition of the RNG (relative neighborhood graph) can be rephrased 
as follows. For two points v, w in the plane, define the lune A VyW as the 
intersection of the two discs of radii d(v, w) centered at v and at w. Then 



Figure 6. The relative neighborhood graph on a realization of 500 random 
cities. 



there is a road between two cities x, y if 
and only if the lune A ViW contains no other city. (9) 

It is elementary [11] that the RNG contains the minimal spanning tree and 
is therefore connected (see section 3.4 for the infinite case). 

Proposition 9 The RNG on a PP satisfies the asymptotic essential con- 
nectedness property O); and so by Theorem [7] it has the linearity property 

&■ 

We'll discuss relations with previous results, after the proof. 
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3.1 Proof of Proposition [9] 

Fix L. We start with a property for "arbitrary" city positions - this means 
"deterministic, but in general position" . 

Lemma 10 Consider an arbitrary finite configuration of cities in the Lx L 
square. Consider the network Ql from |3p; that is, the roads within the 
L x L square that are in the RNG for this configuration together with every 
locally finite configuration of cities outside the square. Suppose Ql is n °t 
connected. Write A(v) for the distance from v to the boundary of the L x L 
square. Then, for any initial city vo, there exists a sequence of distinct cities 
vq,vi,... ,v m , for some m>0, such that 

(i) d(v ,vi) > d(v 1 ,v 2 ) > ... > d(v m -i,v m ) 

(ii) If m = then every city within distance A(t>o) from vq is in the same 
component as vq 

(Hi) if m > 1 then A(v m ) < d(vo,vx). 

Proof. Write components for the connected components of Ql- For each 
city v, consider the nearest city (w, say) in some different component than 
v. If the lune A v>w intersects the boundary of the L x L square, call v a 
terminal city. Otherwise, write w = 9{v). Because (v,w) is not an edge of 
the RNG, the lune A VtW must contain some other city, and (by definition 
of w as closest) any such other city z must be in the same component as 
v; note also d(z,w) < d(v,w). Thus if w is not a terminal city then 9(w) 
satisfies d(w,9(w)) < d(w,z) < d(v,w). Finally, note that if w = 6{v) is 
a terminal city, then the lune A W)U intersects the boundary of the L x L 
square, where u is the closest city to w in a different component than w\ 
since d(w, u) < d{w, v) it must be that w is within distance d(v, w) from the 
boundary. 

Combining the properties established above, we see that, for any vo, 
the sequence v\ = #(t>o),i>2 = 9{v\), . . . , V{ = 6{vi-\), . . . continued until 
reaching some terminal v m , satisfies the assertions of the lemma. ■ 

We next give a lemma about Poisson points, not involving the RNG. 

Lemma 11 Consider the points S of a PP in a L x L square. 

(a) There is a constant c* such that, with probability — ► 1 as L — > oo, the 
length of the longest edge of the minimal spanning tree on the points 3 is at 
most c* log L. 

(b) Fix do and n. The probability that there exists a sequence £o) £i) • • • , £n of 
distinct points in H such that do > <i(£o>£i) — ^(£ij£2) > • ■ ■ > d(£n-i>£n) 
is at most — , . 
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Proof. Part (a) is a weaker version of the known precise asymptotics [15] 
for longest MST edge length. For (b), if we consider the PP on the whole 
plane, and require that £q, but not the other £j, be in the Lx L square, then 
the expected number of sequences satisfying the condition in (b) equals 

I? If... I ta^ta*...^. 

J J J do>ri>r2..->r n >0 

so this is an upper bound on the desired probability. By symmetry of the 
n! orderings, 

. . I r\dr\ r 2 dr 2 . . . r n dr n = — IT / ndn = — I I 

Jdo>ri>r 2 ...>r„>0 n! f=V° Tl\ \2J 

and the result follows. ■ 

We now start to combine the deterministic and random ingredients. 
Write Ql for the subnetwork ^ of the RNG on the points 3 of a PP in 

Lemma 12 Fix do and n. The probability of the following event is at most 

L 2 -K n dl n 

There exist vertices £' and in different components ofQi, with A(£') > 
d (n+ 1), such that d(f ,£") < d Q . 

Proof. It is enough to show the event here implies the event in Lemma 
[11(b). We write (i ,ii,iii) for the assertions of Lemma flOl 



Suppose such vertices exist. Then Ql is not connected, and we 

may consider the sequence £' = Vq, vx, . . . , v m given by Lemma 10 By (ii) 
we have m > 1. Because d(vo,vi) < d(£',£") < do, monotonicity in (i) 
implies <i(fo, u m ) < dom. Because A(uo) < d{vQ.v m ) + A(u m ), we can apply 
(hi) to get A.(vo) < do(m + 1). But by hypothesis A(uo) > do(n + 1), so we 
have shown m > n. So the event in Lemma [XTj^b) occurs. ■ 

Proof of Proposition [9] Set 

n = n(L) = [L 1/2 \ , d = d {L) = c* log L, M = M(L) = L — 2(1 + n(L))d 

and suppose L is sufficiently large that M > 1. As above, write C/z, for the 
subnetwork ([3| of the RNG on the points 3 of a PP in [0, L] 2 . Consider the 
events 

(Al): There exist vertices £' and in different components of Ql, with 
A(0 > d (n + 1), such that < do. 
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{Bl): In the minimal spanning tree on the points of 3 within the M x M 



square concentric with [0, L\ , the longest edge is longer than d { 



Lemma [TT[a) implies P(B£) — > as L — > oo. Lemma 12 



o- 



says 



t 2 n j2n 

p(j4l) < t 7T Qo o as L — >• oo, 
n! 



the convergence by a routine use of Stirling's formula and the definitions of 
do(L) and n{L). Now we assert 

if neither Al not then all the cities in the concentric 

M x M square are in the same component of Ql- (10) 

For if the implication were false, then some edge (£', £") of the MST (on 
cities in the M x M square) links cities in different components of Ql, and 
because B L fails we have d(f',f") < do- But A(f) > (L-M)/2 = d (n + l) 
by definition of M, so ^ holds. 



But assertion (10) implies 

N° L < N L \ M + N l 1 AlUBl 

where N® is the number of cities inside [0,L] 2 that are not in the largest 
component of Ql, Nl is the total number of cities inside [0, L] 2 , and N l \m 
is the number of cities inside [0,L] 2 but not inside the concentric M X M 
square. Taking expectation, 

L- 2 ENl < 1 - ^+E(L- 2 N L U l ub l ) 

and the right side — > by definition of M(L), uniform integrability of 
{L~ 2 Nl) and the fact P{A L U B L ) -»■ 0. ■ 



3.2 Proximity graphs 

There is a general class of proximity graphs [11 which as defined at ([9| 
but with the lune A v>w replaced by some specified subset of the lune; such 
networks on a PP a priori have more roads than the RNG and therefore can 
only have shorter network distances. So the linearity property remains true 
for this class of networks. 

For any finite spatial network one can define its stretch as the maximum, 
over city-pairs (x, y), of the ratio £(x, y)/d(x, y). Now consider a network, in 
the sense of a deterministic rule for assigning roads to any finite configuration 
of cities. Such a network is called [13] a t-spanner if the stretch is always 
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bounded by the constant t. Clearly the property of being a spanner (that 
is, a t-spanner for some t) is stronger than our linearity property (|2]). It 
is a remarkable result [12] that the Delaunay triangulation is a i-spanner 
^ or * = 3 cos 7 7r/6 ~ 2.42. Consequently, while our Theorem jlj can easily be 
applied to the Delaunay triangulation over the PP, it does not yield any 
new result. On the other hand the RNG and related proximity graphs are 
known not to be spanners, and (when constructed over n random points in 
a square) the expectation of stretch increases slowly to infinity as n — > oo. 
See j6]. So our results do say something new about this model. 

3.3 Modifications of disconnected networks 

Our results suggest the following general program, which we have not pur- 
sued. Starting with (for instance) the supercritical geometric random graph, 
one might guess that any reasonable ad hoc scheme for adding edges to cre- 
ate a connected network would yield a network with the linearity property, 
and one could seek to verify this for any given scheme by using Theorem 
[T] Similarly, starting from a more general disconnected network, one can 
always create a connected network by adding all edges (v,w) of the RNG 
such that v and uu are in different components of the original network; given 
any particular original network one could seek to modify the proof of Propo- 
sition [9] to show that the asymptotic essential connectedness property Q 
holds and thereby establish the linearity property. 

3.4 Remarks about the minimum spanning tree 

On a general infinite (but locally finite) configuration of points, the analog 
of the MST is the minimum spanning forest. It is true [3], but not obvious, 
that when applied to the PP one gets a single tree. This implies in particular 
that the RNG over the PP is connected. 

It is widely believed in the statistical physics literature (see e.g. [T71 [18] 
for references) that for the MST over the PP, quantities such as ET 2 should 
grow as some power |z| 7 where 7 ~ 1.23. But it is not clear to us what has 
been rigorously proved. Intuitively, no tree network can have the linearity 
property Q, and proving this would be a minor research project. 
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